Express Mail No.: EV338034347US PATENT APPLICATION 

Attorney Docket No. 44500-00342 

INDUSTRIAL PROCESS FAULT DETECTION 
USING PRINCIPAL COMPONENT ANALYSIS 

FIELD OF THE INVENTION 
5 The present invention relates generally to the field of evaluating the progress and 

efficacy of industrial processes by analyzing (in real time) the data streams produced by 
sensors that monitor the processes, and more particularly to monitoring/evaluating 
plasma processes useful in the fabrication of semiconductor microelectronic circuits, 
microelectromechanical devices, and the like. 

10 

BACKGROUND OF THE INVENTION 
Industrial processes used for the manufacture of a wide variety of valuable goods 
are often automated such that the processing equipment performs without direct control 
of human operators. Sensors are typically utilized in such situations to measure aspects 

15 of the processing operation, either for direct feedback control, or for confirmation that 
processing conditions are within established norms. 

Plasma is used in various types of industrial processes in the semiconductor and 
printed wiring board industries, as well as in various other industries such as in the 
medical equipment and automotive industries. One common use of plasma is for etching 

20 away materials in an isolated or controlled environment. Various types of materials may 
be etched by one or more plasma compositions, including glasses, silicon or other 
substrate materials, organics such as photoresist, waxes, plastics, rubbers, biological 
agents, and vegetable matter, and metals such as copper, aluminum, titanium, tungsten, 
and gold. Plasma is also utilized for depositing materials such as organics and metals 

25 onto an appropriate surface by various techniques, such as via chemical vapor deposition. 
Sputtering operations may also utilize plasmas to generate ions which sputter away 
material from a source (e.g., metals, organics) and deposit these materials onto a target 
such as a substrate. Surface modification operations also use plasmas, including 
operations such as surface cleaning, surface activation, surface passivation, surface 

30 roughening, surface smoothing, micromachining, hardening, and patterning. 

Plasma processing operations can have a significant effect on a company's profit 
margin. This is particularly true in the semiconductor and printed wiring board industries. 
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Consider that a single semiconductor fabrication facility may have up to 200-300 
processing chambers and that each processing chamber in commercial production may 
process at least about 15-20 wafers per hour. Further consider that an eight inch wafer 
which is processed in one of these chambers in some cases may be used to produce up to 

5 600 individual semiconductor chips which are each worth at least about $125, and that 
each of these semiconductor chips are in effect "pre-sold." Therefore, a single wafer 
which has undergone an abnormal plasma process and which is scrapped can result in lost 
revenues of at least about $75,000. 

The particular plasma process which acts on the wafer such that a semiconductor 

10 device may be formed therefrom is commonly referred to as a plasma recipe. Plasma 
processes may be run on wafers in a commercial production facility in the following 
manner. A cassette or boat which stores a plurality of wafers (e.g., 25) is provided to a 
location which may be accessed by a wafer handling system associated with one or more 
processing chambers. One wafer at a time is processed in the chamber, although some 

15 chambers may accommodate more than one wafer at a time for simultaneous plasma 
processing. One or more qualification wafers may be included in each cassette, and the 
rest are commonly referred to as production wafers. Both the qualification and production 
wafers are exposed to the same plasma process in the chamber. However, no semi- 
conductor devices are formed from a qualification wafer as qualification wafers are 

20 processed and retained solely for testing/evaluating the plasma process, whereas 
semiconductor devices are formed from the production wafers. Further processing 
operations of these now plasma processed production wafers are required before 
semiconductor devices are actually formed from such production wafers. 

Monitoring is employed in connection with many plasma processes to evaluate 

25 one or more aspects of the process. One common monitoring technique associated with 
plasma recipes run on wafers is endpoint detection. Endpoint detection is concerned with 
identifying when one or more steps of a given plasma recipe is/are complete, or more 
specifically that point in time when the predetermined result associated with a plasma 
step has been produced on the product (e.g., when a layer of a multi-layered wafer has 

30 been completely removed in a manner defined by a mask or the like). Many endpoint 
detection techniques operate by identifying the point in time when the intensities of 
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particular wavelengths or spectral bands of optical energy emitted from the plasma 
processing chamber change. Such intensity changes result, for example, from a layer 
being completely etched away and material from a lower, different layer beginning to be 
removed and dispersed within the chamber, as well as, for example, various gases used in 
5 the plasma process no longer being consumed at the same rate when the layer is 
substantially removed. 

As such, typical endpoint detection techniques are not concerned with identifying 
abnormal conditions that may occur during the processing of a particular wafer nor 
evaluating trends that may be occurring within a processing chamber over time as 
1 0 multiple wafers are processed in accordance with a particular plasma recipe. Commonly 
used endpoint detection techniques provide no information on how the plasma process 
has actually proceeded or the "health" of the plasma process-only if and when an 
endpoint of the subject plasma step has been reached. Other monitoring techniques that 
are commonly used in plasma processes suffer from this same type of deficiency. 
15 Pressures, temperatures, and flow rates of the feed gases used to form the plasma are 
commonly monitored. Various aspects relating to the electrical system associated with 
the plasma are also monitored, such as the power settings being utilized since this will 
affect the behavior of the plasma. However, these types of monitoring operations do not 
necessarily provide an indication of how the plasma process is actually proceeding. All of 
20 the "hardware" settings may be correct, but still the plasma may not be performing 

properly for a variety of reasons (e.g., an "unhealthy" plasma). Since errors in a plasma 
process are typically detected by some type of post processing, destructive testing 
technique, multiple wafers are typically exposed to the faulty plasma process before the 
error is actually identified and remedied resulting in many wafers that need to be 
25 scrapped at great cost. 

In addition to endpoint detection techniques, various techniques are known for 
monitoring the "health" of a plasma process as it is performed on a group of wafers. One 
such plasma health monitoring technique relies on pattern recognition techniques to 
determine if optical spectra from the processing chamber match at least one previously 
30 stored "normal" or "healthy" spectrum. As may be appreciated, such techniques can be 
quite computationally intensive due to the amount of spectral data involved and also 
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require the establishment of normal spectra which may be searched for a match. In one 
known plasma process monitoring technique, principal component analysis (PCA) has 
been used to reduce the amount of spectral data that must be processed to a more 
manageable size. 

5 

SUMMARY OF THE INVENTION 
Accordingly, one object of the present invention is to provide a more accurate 
indication of how an entire industrial process, such as a plasma process, has proceeded on 
a more "real-time" basis, and thereby allow for a reduction in the number of product units 

10 (e.g., semiconductor wafers) that are exposed to abnormal processes. 

Another object of the present invention is to provide for the identification of the 
existence of an error in an industrial process, such as a plasma process, at least before the 
next product unit (e.g., a semiconductor wafer) is exposed to such an abnormal process. 
In accordance with the present invention, a method for monitoring an industrial 

15 process involves obtaining sensor data corresponding to a plurality of product units as the 
industrial process is performed on the product units, building one or more models of the 
industrial process from the sensor data as the sensor data is obtained, performing a 
principal components analysis on the model(s) to identify significant features present in 
the sensor data included in the model(s), using the results of the principal components 

20 analysis to generate residual data vectors representing the difference between expected 
sensor data for a product unit and the actual sensor data for a product unit, and obtaining 
information from the residual data vectors relating to the industrial process. 

In accordance with the present invention, a system for use in monitoring industrial 
processes includes a data selection module for obtaining one or more data scans for each 

25 product unit as an industrial process is performed on a group of product units, a sample 
matrix building module for building one or more sample matrices of the process from the 
data scans as the scans are obtained with each sample matrix and principal singular 
vectors thereof comprising a model, a PCA analysis module for performing a principal 
components analysis on the model(s) to identify significant features present in the data 

30 included in the model(s) and using the results of the principal components analysis to 
generate residual data vectors representing the difference between an expected data scan 
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for a product unit and an actual data scan for a product unit, a model maintenance module 
operable to identify a process and store and retrieve model data associated with the 
identified process, and a product unit categorization module for categorizing each product 
unit based on information (e.g., a scalar metric) obtained from the residual data vectors. 

5 In one embodiment of the present invention, the industrial process is a plasma 

process, the product units are semiconductor wafers, and the sensor data obtained 
comprise one or more optical emissions spectra (OES) data scans (also referred to herein 
as data vectors) for each wafer. In other embodiments, the industrial process may 
comprise other processes used in various industries such as the semiconductor, 

10 automotive, or pharmaceutical industries, for example, a deposition process, an ion- 
implant process, a micro-machining process, a wet cleaning process, a wet plating 
process, a plastic forming process, a solvent cleaning process, a painting process, or a 
coating process, and the sensor data may, for example, comprise electrical sensor data, 
radio-frequency (RF) sensor data, or residual gas analysis (RGA) mass spectrometer data. 

15 According to one aspect of the present invention, two types of OES scan data are 

obtained for each wafer. A first type of data comprises a particular OES scan taken at a 
particular sampling time during processing of the wafer. Such scan may be taken at the 
endpoint of the plasma process that is performed on the wafer. The second type of data 
comprises an average of a selected portion of the OES scans taken for the wafer. In this 

20 regard, OES scans taken for the wafer after the plasma process has stabilized and prior to 
it losing stability near the end of the process may be averaged to obtain the average OES 
scan for a wafer. 

According to another aspect of the present invention, four categories of models 
are built for each plasma process recipe and for each processing chamber. A first model 

25 is comprised of the first type of data and such model is not updated after it is complete. 
A second model is comprised of the second type of data and such model is not updated 
after it is complete. A third model is comprised of the first type of data and such model is 
updated after it is complete. A fourth model is comprised of the second type of data and 
such model is updated after it is complete. In this regard, all four models may be 

30 considered complete when a predetermined number of OES scans are included in the 

models. The third and fourth model may be updated by deleting scans corresponding to a 
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wafer processed earlier in time and adding scans corresponding to a wafer processed later 
in time. 

According to a further aspect of the present invention, unique models are 
maintained corresponding to a multiplicity of recipes being run in a particular processing 
5 chamber, and also corresponding to one of several processing chambers being monitored. 
It is typical for the wafers in a cassette to be distributed by the wafer handling system to 
different chambers. As the sensor data from different chambers may contain different 
characteristics, but each is nonetheless "normal" for that chamber, the present invention 
accounts for chamber-to-chamber variability by maintaining separate models. Also, the 
10 process recipe, determined by the requirement for each incoming wafer, may change. 
The present invention associates unique models (one of each category) with the recipe in 
use, and changes the models in use when the recipe changes. 

According to a further aspect of the present invention, the residual spectrum 
vectors for each model are reduced to scalar values in order to obtain a metric that 
1 5 characterizes the plasma process. In this regard, the standard deviation of the data in 
each residual spectrum vector is calculated. In a related aspect, the standard deviations 
are normalized by dividing the single-wafer standard deviation by an average standard 
deviation for the process. 

According to yet another aspect of the present invention, the normalized standard 
20 deviations of the residual spectrum vectors are utilized to assess whether the plasma 
process has been properly performed for each wafer. In this regard, each wafer may be 
assigned to one of three categories (e.g., normal, warning, failure) based on how the 
normalized standard deviation relating thereto compares with a value of unity. 

According to an additional aspect of the present invention, the category assigned 
25 to each wafer may be utilized to assess the validity of an endpoint of a plasma-processing 
step that has been determined by a separate endpoint determination technique. In this 
regard, if the plasma process has been determined to have failed for a particular wafer, 
then the endpoint may be considered to be improperly determined. If the plasma process 
has been determined to be in the warning category for a particular wafer, then the 
30 endpoint may be considered to be questionable and in need of further assessment. If the 
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plasma process has been determined to be normal for a particular wafer, then the 
endpoint may be considered to be reliable. 

These and other aspects and advantages of the present invention will be apparent 
upon review of the following Detailed Description when taken in conjunction with the 
5 accompanying figures. 

DESCRIPTION OF THE DRAWINGS 
For a more complete understanding of the present invention and further 
advantages thereof, reference is now made to the following Detailed Description, taken in 
10 conjunction with the drawings, in which: 

FIG. 1 is a schematic diagram of one embodiment of a plasma monitoring 
assembly in which a plasma process fault detection module in accordance with the 
present invention may be implemented; 

FIG. 2 is a block diagram providing an overview of various sub-modules included 
1 5 in one embodiment of a plasma fault detection module in accordance with the present 
invention; 

FIGS. 3A-D are flowcharts showing various information-processing steps 
included in one embodiment of a plasma process fault detection module in accordance 
with the present invention; 
20 FIG. 4A is a plot showing exemplary normalized standard deviation values 

determined from a dynamically updated model obtained by the plasma process fault 
detection module for wafers that have undergone a plasma process; and 

FIG. 4B is a plot showing exemplary normalized standard deviation values 
determined from a static model obtained by the plasma process fault detection module for 
25 wafers that have undergone a plasma process; 

DETAILED DESCRIPTION 
Plasma Monitoring Assembly 
Referring to FIG. 1, there is shown one embodiment of an assembly for 
30 monitoring/evaluating plasma processes (hereafter "plasma monitoring assembly 10") 
that may be used to collect optical emissions spectra (OES) data from a plasma process 
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conducted within a plasma processing chamber 12 of a wafer production system. In other 
embodiments, sensors other than those of OES may be used. The plasma monitoring 
assembly 10 operatively interfaces with a window 14 of the plasma processing chamber 
12 by receiving optical emissions of the plasma which pass out of the processing chamber 

5 12 through the window 14. These optical emissions are "collected" by an appropriate 
fiber optic cable 16, one end of which is positioned at or near the outer surface 18 of the 
window 12. Optical emissions of the plasma within the processing chamber 12 during 
processing of a wafer therein enter the fiber optic cable 16 and are directed to a 
spectrometer assembly 20. Both scanning-type and solid state spectrometers may be used 

10 as the spectrometer assembly 20. The spectrometer assembly 20 may also include one or 
more appropriately interconnected spectrometers, each of which obtains optical emissions 
data from a different region. The spectrometer assembly 20 separates these optical 
emissions into a plurality of individual wavelengths and, in one embodiment, provides 
these separate optical components to an array 22 of charge coupled devices (hereafter 

15 "CCD array 22") for conversion to a corresponding computer-readable electrical signal. 
The computer-readable electrical signal is provided by the CCD array 22 to a plasma 
monitor control unit 24 (hereafter "PMCU 24") which is the primary control mechanism 
of the plasma monitoring assembly 10. In one embodiment, the PMCU 24 is a computer 
which may be configured to include, but not limited to, at least one motherboard, at least 

20 one analog-to-digital conversion board, at least one central processing unit (CPU) for 
each motherboard, and one or more types of computer-readable storage mediums such as 
at least one floppy disk drive, at least one hard disk drive, and at least one optical media 
drive (e.g., a CDR, CDRW, DVDR or DVDRW drive). Other hardware may be 
operatively interconnected with the PMCU 24, such as a display 26 for providing 

25 visual/audio-based information to operations personnel (e.g., a CRT, LCD, or computer 
monitor), as well as one or more data entry devices 28 (e.g., mouse, light pen, keyboard) 
for allowing personnel to enter information used by or relating to the plasma monitoring 
assembly 10. One PMCU 24 may be provided for each chamber 12, or the PMCU 24 may 
be configured to service multiple chambers 12. The PMCU 24 is also operatively 

30 interfaced or interconnected with the main control unit 30 (hereafter "MCU 30") of the 
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wafer production system including the plasma processing chamber 12 such that the 
PMCU 24 and MCU 30 may communicate with each other. 

The PMCU 24 is operable to execute the instructions of a plasma process fault 
detection module 100. The PMCU 24 may also be operable to execute other modules, 

5 such as, for example a plasma process endpoint determination module (not shown) useful 
in determining when the endpoint(s) of particular plasma process step(s) are reached. 
The plasma process fault detection module 100 (and other modules, if any) may be stored 
on a computer-readable storage medium associated with the PMCU 24 (e.g., on a 
portable computer diskette(s), on a hard drive, on a CD(s), on a DVD(s)). 

10 The plasma process fault detection module 100 facilitates the monitoring or evaluation of 
the various types of plasma processes which may be conducted within the processing 
chamber 12 through the evaluation of OES data from the plasma in the chamber 12. In 
the case of plasma monitoring assembly 10 of the FIG. 1, OES data are collected and 
delivered by the fiber optic cable 16 to the spectrometer assembly 20 which divides the 

15 light up into its individual optical components. Data representative of these optical 

emission components are then made available to the PMCU 24 and thereby to the plasma 
process fault detection module 100 via the CCD array 22 as described above. 

Evaluation or monitoring of the current plasma process through the plasma 
process fault detection module 100 is facilitated by collecting optical emissions from the 

20 plasma preferably to include at least wavelengths from within the UV range to within the 
near infrared range, and thereby including the visible light spectrum. In one embodiment, 
optical emissions of the plasma in the processing chamber 12 which are obtained and 
available for evaluation include at least those wavelengths from about 200 nanometers to 
about 1,000 nanometers (inclusive), and more preferably at least those wavelengths from 

25 about 150 nanometers to about 1,200 nanometers (inclusive). Hereafter the above-noted 
desired range or bandwidth of optical emissions data which are obtained/collected from 
the plasma in the chamber 12, and which includes each of the above-noted ranges or 
bandwidths, will be referred to as the "Preferred Optical Bandwidth." 

. Optical or wavelength resolutions within and throughout the Preferred Optical 

30 Bandwidth are preferably no more than about 1 nanometer, and even more preferably no 
more than about 0.5 nanometers. The term "wavelength resolution" in this context means 
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the amount of separation between adjacent wavelengths in the subject optical emissions 
data that are collected. Therefore, if the wavelength resolution being used to collect 
optical emissions data from the plasma in the chamber 12 is 1 nanometer, no more than a 
1 nanometer spacing will exist between any two data points within and throughout the 
5 Preferred Optical Bandwidth. It is typical, due to the optical characteristics of the 
spectrometer, for the wavelength resolution to vary fractionally over the range of the 
Preferred Optical Bandwidth. Hereafter, the above-noted magnitude for the optical or 
wavelength resolution will be referred to as the "Preferred Data Resolution; 1 meaning 
that at no point in the Preferred Optical Bandwidth does the resolution exceed this value. 

10 Another factor relating to the effectiveness of the plasma process fault detection 

module 100 is the frequency with which OES data is obtained from the plasma in the 
chamber 12. OES data are preferably obtained at least once per every second, and more 
preferably more frequently than once per every second from the plasma in the chamber 
12. Although equal time intervals will typically be utilized in relation to the times at 

15 which OES data is collected from the plasma in the chamber 12, this need not be the case 
such that equal time intervals, unequal time intervals, and combinations thereof may be 
utilized. Hereafter, the above-noted sampling frequencies for obtaining OES data from 
the plasma in the chamber 12 will be referred to as the "Preferred Sampling Frequency". 
The spectrometer assembly 20 illustrated in FIG. 1 should be capable of meeting 

20 the above-noted criteria for the Preferred Optical Bandwidth, Preferred Data Resolution 
and Preferred Sampling Frequency, and a number of implementations may be utilized. 
For instance, the spectrometer assembly 20 may consist of an optical dispersive element 
such as a grating and a solid state device consisting of a linear array of from 512 to 2048 
detection elements. Alternatively, multiple solid-state linear arrays may be oriented in 

25 the dispersive pattern to fully cover the Preferred Optical Bandwidth while achieving the 
Preferred Data Resolution. The spectrometer assembly 20 may also be of the scanning 
type in which the dispersive element is moved such as to scan the spectrum across a 
single detector to obtain data encompassing the Preferred Optical Bandwidth using the 
Preferred Data Resolution and at the Preferred Sampling Frequency. 

30 Regardless of how it is generated, the raw OES data for a wafer includes multiple 

scans or vectors of values representing the intensity of selected wavelengths as 

10 
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determined by the Preferred Data Resolution within the Preferred Optical Bandwidth 
taken at the sampling times determined by the Preferred sampling frequency. 

Plasma Fault Detection Module 

5 Referring now to FIG. 2, there is shown a block diagram providing an overview 

of the plasma process fault detection module 100 and a number of sub-modules or 
functions included in the plasma process fault detection module 100. In the present 
embodiment, the plasma process fault detection module 100 is implemented as a 
computer software program that is executable by the microprocessor of the PMCU 24. In 

10 other embodiments, the plasma fault detection module 100 might be implemented in 

hardware such as, for example, an application specific integrated circuit (ASIC) or a field 
programmable gate array (FPGA). The plasma process fault detection module 100 
employs techniques of principal component analysis (PCA) in order to evaluate OES data 
obtained for a plasma process performed on a group of wafers and identify potential 

15 faults that may have occurred during the plasma process. 

The plasma process fault detection module 100 includes a data selection sub- 
module 101, a sample-matrix building/updating sub-module 102, a PCA analysis sub- 
module 103, a model maintenance sub-module 104, a wafer categorization sub-module 
105, and a data output sub-module 106. The data selection sub-module 101 is 

20 responsible for obtaining selected OES data for each wafer that is processed. In general, 
selected sensor data that are obtained are referred to herein as a data vector, record or 
scan. In the present embodiment where the sensor data are OES data, the data vector is 
referred to herein as a spectrum vector. The OES data that are selected for a particular 
wafer may, for example, be just one scan taken at a particular sampling time, or it may, 

25 for example, be an average scan of a number of separate scans taken at multiple sampling 
times. In the present embodiment, two scans are obtained for each wafer. The sample- 
matrix building/updating sub-module 102 is responsible for constructing multiple 
matrices composed of multiple columns of data vectors from the selected OES data for a 
number of wafers. In the present embodiment, four sample matrices are constructed for 

30 each identified process. A process is identified by the chamber in which the processing 
takes place and by the specific process recipe being employed in the chamber. The 
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sample matrices are representative of OES data expected from a wafer in which the 
identified plasma process has proceeded as expected. One or more of the sample 
matrices may be updated by the model building/updating sub-module 102 when more 
OES data becomes available as additional wafers are processed. In the present 
embodiment, two of the four sample matrices for each identified process are updated. 
The PC A analysis sub-module 103 computes the principal singular vectors from the 
sample matrices and utilizes PCA techniques to determine whether the selected OES data 
for a particular wafer differs significantly from that expected for a normal wafer as 
represented by the models. In this regard, PCA techniques are utilized to generate a 
predicted OES scan that is compared with the actual OES scan to obtain a residual 
spectrum vector. The model maintenance sub-module 104 saves and retrieves each 
model, consisting of a sample matrix and the corresponding principal singular vectors, to 
ensure that the data for the current wafer under process is associated with the correct 
identified process. In this regard, the saving and retrieving may be done from computer 
memory, or from a data storage device such as a hard disk drive. The wafer 
categorization sub-module 105 categorizes each wafer based on the residual spectrum 
vector. In the present embodiment, the residual spectrum vector is reduced to a scalar 
value, that is in turn normalized, and the wafers are categorized based on the normalized 
scalar values corresponding their respective residual spectrum vectors. The wafers may, 
for example, be categorized as being within a first range wherein the plasma process 
proceeded normally, within a second range considered to be an indication or warning that 
something may have been wrong with the plasma process, or a third category considered 
to be an indication that the plasma process has failed. The data output sub-module 106 is 
responsible for outputting the results (e.g., the normalized scalar values corresponding to 
the residual spectra, the category assigned to a wafer) to a user, for example on the 
display 26 of the PMCU 24. The data output sub-module 106 may also save the results 
obtained to a storage medium for subsequent processing and analysis. As may be 
appreciated, the responsibilities of the various sub-modules may overlap and an action 
undertaken by one sub-module may cause one or more other sub-modules to undertake 
particular actions. 
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Referring now to FIGS. 3A-D, there is shown a flow diagram illustrating the 
process steps undertaken by one embodiment of the plasma process fault detection 
module 100 and the various sub-modules thereof. 

The plasma process fault detection module 100 procedure begins with step 1 10. 

5 In step 1 10, various parameters are initialized. Parameters that are initialized in step 1 10 
may be of two types, non-selectable parameters that are predetermined in advance of the 
plasma process fault detection module 100 being run and user input parameters that are 
received (e.g., from the data entry device 28 of the PMCU 24) from a user of the plasma 
process fault detection module 100 when it is started. One such input parameter is the 

10 model size. Model size is the number of wafers for which data is included in the models 
that are constructed. In the present embodiment, the desired model size of the plasma 
process fault detection module 100 must be within predetermined lower and upper limits. 
Furthermore, while the model size for each recipe and chamber may differ, the present 
embodiment uses the same model size for each category of model for the same process 

15 and chamber in order to avoid subsequent analysis and interpretation problems that may 
be present in trying to analyze and compare the results from, for example, a model based 
on only two wafers versus a model based on one-hundred wafers. Other parameters may 
include specified limits for use in categorizing each wafer. In the present embodiment, 
such lower and upper limits for model size are 4 and 100, respectively, but in other 

20 embodiments these limits may be different or even not included. As may be appreciated, 
in other embodiments, some of the user input parameters may instead be non-selectable 
parameters, and some of the non-selectable parameters may instead be user input 
parameters. When step 1 10 is complete, the plasma process fault detection module 100 
proceeds to step 120. 

25 In step 120, selected data is obtained from raw OES data relating to a particular 

wafer i that has undergone a plasma process. In this regard, the raw OES data may have 
been generated using, for example, a plasma monitoring assembly 10 as illustrated in 
FIG. 1, and saved on the hard drive of the PMCU 24. Step 120 may incorporate a listener 
instruction that waits until raw OES data is available for wafer i. Step 120 may also 

30 incorporate an interrupt instruction that checks to see if a user of the plasma process fault 
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detection module 100 has requested the plasma process fault detection module 100 to 
stop (e.g., if plasma processing of wafers has been halted). 

Once raw OES data is available for wafer i, the plasma process fault detection 
module 100 accesses the saved raw OES data and obtains two records or scans of OES 

5 data therefrom. The first scan is referred to as the endpoint or EP scan. The EP scan 
comprises the OES data vector obtained at the sample time when endpoint of the plasma 
process is determined to have occurred (e.g. by an endpoint determination module), 
which may or may not be the last vector in the raw OES data. The EP scan is not 
necessarily used for real time endpoint determination, but rather is used to compare 

10 normal and anomalous scans for endpoint verification after the fact (i.e., after endpoint 
has been determined in accordance with a separate endpoint determination module). The 
second scan is referred to as the non-endpoint or nonEP scan. The nonEP scan comprises 
the average of a subset of the multiple raw OES data vectors for the wafer. In the present 
embodiment, the middle 60 to 80 percent of the raw OES data vectors for a wafer are 

15 averaged to obtain a single average OES data vector that comprises the nonEP scan. For 
example, where the raw OES data includes 100 raw OES data vectors (e.g., for a 100 
second long plasma process sampled once per second), raw OES data vectors 20 through 
80 (the middle 60 percent) may be averaged to obtain the nonEP scan, raw OES data 
vectors 10 through 90 (the middle 80 percent) may be averaged to obtain the nonEP scan, 

20 or some other subset of the 100 raw OES data vectors may be averaged to obtain the 
nonEP scan for a wafer. Averaging raw OES data to obtain a single nonEP scan 
increases the signal to noise ratio in the data by reducing the effect of random noise in the 
data. Averaging only a subset of the raw OES data vectors to obtain a single nonEP scan 
permits OES data near the beginning and the end of a plasma process when the plasma 

25 may not be stable to be ignored in obtaining the nonEP scan. Averaging the raw OES 
data to obtain a single nonEP scan also provides advantages in rotational type processing 
tools wherein the processing plasma experiences a cyclical variation. Each of the EP and 
nonEP scans are assigned to respective vectors Xi(EP) and Xi(nonEP), which hereafter 
may be referred to as just x\. 

30 After obtaining the current EP and nonEP scans Xj, it is determined in step 130 

whether the current wafer index i is greater than 1. If it is determined that wafer index i 
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is not greater than 1 then the plasma process fault detection module 100 proceeds to step 
140. In step 140, sample matrices X m are created from the appropriate first EP and 
nonEP scan vectors Xi. In the present embodiment, a total of four sample matrices X m 
are created for each group of wafers that is processed, but in other embodiments, more or 
5 fewer sample matrices X m may be created. Subscript m is the model index, and in the 
present embodiment m equals 1, 2, 3 or 4. Each sample matrix X m corresponds with one 
of four models that are referred to herein as the static EP model, the static nonEP model, 
the dynamic EP model, and the dynamic nonEP model. Although the order of the models 
is not significant, for subsequent reference purposes, sample matrix Xj corresponds with 

1 0 the static EP model, sample matrix X 2 corresponds with the static nonEP model, sample 
matrix X 3 corresponds with the dynamic EP model, and sample matrix Xt corresponds 
with the dynamic nonEP model. In the description that follows, the sample matrix for a 
particular model may simply be identified as X m . Each of the static and dynamic EP and 
nonEP sample matrices X m includes OES intensities at wavelength index values l:nl and 

1 5 wafer index values 1 :nw (when complete), where nl is the number of wavelengths 
determined by the Preferred Data Resolution as applied to the Preferred Optical 
Bandwidth, and nw is the number of wafers for which data is included in the models (i.e., 
nw equals the desired model size). After step 140, the plasma process fault detection 
module 100 proceeds to step 340. 

20 If it is determined in step 130 that the current wafer index i is greater than 1, then 

the plasma process fault detection module 100 proceeds to step 150. In step 150, the 
process, identity that wafer i is undergoing is determined and the data (X m , U m ) for each 
model associated with this process identity is read (e.g., from hard disk or random-access 
memory included in the PMCU 24). In the preferred embodiment, the system keeps track 

25 of multiple models for each of a plurality of processing chambers being monitored and 
for each of a plurality of recipes for each chamber. Matrix U m is obtained for each model 
as described below in connection with step 320. After reading the model data, the 
plasma process fault detection module 100 proceeds to step 160. 

In step 160 it is determined whether the sample matrix X m is complete for each 

30 model m. A model is considered complete when the number of individual scan vectors x; 
in matrix X m equals the desired model size. If it is determined in step 160 that a model is 



Express Mail No.: EV338034347US PATENT APPLICATION 

Attorney Docket No. 44500-00342 

not complete, step 170 is performed. In step 170, the current EP or nonEP scan vector \ { 
as appropriate for a particular model is added to sample matrix X m for each model that is 
not complete. Thereafter, it is determined in step 180 whether the current wafer index i 
equals the model size for a particular model If it is determined in step 180 that the 
current wafer index i equals the model size (in which case sample matrix X m has been 
completed for the first time) then the plasma process fault detection module 100 proceeds 
to step 320. If it is determined in step 180 that the current wafer index i does not equal 
the model size (in which case sample matrix X m for a particular model is not yet 
complete) then the plasma process fault detection module 100 proceeds to step 340. 

If it is determined in step 160 that sample matrix X m for each model m is 
complete, then the plasma process fault detection module 100 proceeds to step 190. In 
step 190, principal component vectors pi corresponding with each scan Xj are computed 
for each model. In this regard, the principal components pi are computed in accordance 
with the following expression: 

Pi = U m T * Xi (1) 

where superscript T indicates matrix transpose, * indicates matrix multiplication, i is the 
wafer index (e.g., i =1,2, 3, etc.) and m is the model index (e.g. m = 1, 2, 3, or 4 in the 
case of the present embodiment). It will be appreciated that step 190 is not performed for 
a particular model until it is determined in step 160 that its sample matrix X m is complete. 
However, once the model size is reached and U m is obtained for a particular model in step 
320, the principal components pi corresponding with each scan x\ in the model may be 
computed. In computing the principal components pi for the EP and nonEP models, 
matrix U m is determined once initially for static models, and is periodically updated for 
dynamic models. Conditions for updating U m and the calculation of U m are described for 
steps 310 and 320, below. After step 190, the plasma process fault detection module 100 
proceeds to step 200. 

In step 200, a predicted spectrum vector xpi is computed for each model based on 
the principal components pi and the current matrix U m . In this regard, the predicted 
spectrum is computed in accordance with the following expression: 
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(2) 



where * indicates matrix multiplication, i is the wafer index number (e.g., i = 1, 2, 3, 
5 etc.) and m is the model index (e.g. m = 1 , 2, 3, or 4 in the case of the present 

embodiment). In computing the predicted spectrum xpi for the static EP and nonEP 
models, matrix U m does not change after initial creation. In computing the predicted 
spectrum xpi for the dynamic EP and nonEP models, matrix U m is the periodically 
updated matrix for each such model , as will be described below for step 310. After step 
10 200, the plasma process fault detection module 100 proceeds to step 210. 

In step 210 a residual spectrum vector rj is computed for each model as the 
difference between the actual spectrum vector x\ and the predicted spectrum vector xpi in 
accordance with the following expression: 



where i is the wafer index number (e.g., i = 1, 2, 3, etc.). After step 210, the plasma 
process fault detection module 100 proceeds to step 220. 

In step 220, the residual spectrum vector rj for each model is reduced to a scalar 
20 value by calculating the standard deviation of the residual spectrum vector iy. 



where i is the wafer index number (e.g., i = 1, 2, 3, etc.) and k is the index number of the 
25 wavelength channel of the spectrometer. After step 220, the plasma process fault 

detection module 100 proceeds to step 230. , 

In step 230, the standard deviations 0\ obtained in step 220 for each model are 

normalized to obtain normalized standard deviation values anoraii. Normalization is 

accomplished by dividing each scalar standard deviation value o x for a particular model 
30 by an average standard deviation value a aV erage for that model. In the present 

embodiment, the average standard deviation value a av erage is first obtained for each model 
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(4) 
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by averaging a first set of standard deviations comprised of the standard deviation for the 
first wafer through the standard deviation for the wafer having an index value i equal to 
the model size, and this average standard deviation is used to normalize each of the 
standard deviation values 0\ up to the model size. Thereafter, the average standard 
deviation value a aV erage is obtained for each model by averaging a second set of standard 
deviations comprised of the standard deviation for the wafer having an index value i 
equal to the model size plus one through the standard deviation for the wafer having an 
index value i equal to the model size plus one-half of the model size, and this average 
standard deviation is used to normalize each of the standard deviation values 0\ for the 
wafers having index value i greater than the model size. For the purpose of normalizing, 
only "normal" wafers are utilized. For example, where the model size selected by the 
user is 100 wafers, the standard deviations 0\ for wafers 1 through 100 are normalized by 
dividing each such value by: 

Average = Ol+ <*2 + + <TlOo) / 100 (5) 

Thereafter, the standard deviations o x for wafers 101 on are normalized by dividing each 
such value by: 

^average = O101+ tfio2 + - + ^iso) / 50 (6) 

As may be appreciated the average standard deviation value a av erage used to normalize the 
standard deviation values a aV era g e obtained for each model may be computed in many 
other manners. Normalization of the standard deviation values 0\ for each model scales 
the results of each model so that a "normal" standard deviation value for each model is 
one, thereby allowing the results of each model to more easily be compared. 
Normalization also facilitates comparison of the results obtained for a particular process 
run in a particular chamber with the same process run in a different chamber on the same 
or different tools or the same process run in the same chamber at a later time. After step 
230, the plasma process fault detection module 100 proceeds to step 240. 
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In step 240, the normalized standard deviation values anornii are utilized to 
identify whether a fault has occurred in the plasma process performed on a wafer. In the 
present embodiment, this categorization step 240 is only applied to the normalized 
standard deviation values anornii of the dynamic EP and nonEP models. In this regard, 

5 each normalized standard deviation value anonrii for a particular dynamic model is 
categorized based upon a specified standard into one of three classes: 1) "normal"; 2) 
"warning"; and 3) "failure". The standard is based upon the particular normalized 
standard deviation value anormj. If the value is less than or equal to a first specified 
amount, the wafer is categorized as being in the "normal" category. If anormj is greater 

10 than the first specified amount but less than or equal to a second specified amount, the 
wafer may be categorized as being in the "warning" category. If anorm, is greater than 
the second specified amount, the wafer may be categorized as being in the "failure" 
category. The first and second specified amounts may be predetermined or they may be 
input by a user. A typical first amount is 2, and a typical second amount is 3. As may be 

15 appreciated, fewer or more categories may be defined based upon similar or different 

standards. After step 240, the plasma process fault detection module 100 proceeds to step 
250. 

In step 250, messages indicating that a particular wafer has been categorized as 
being within a particular category are generated. For example, when the wafer is 

20 categorized as "normal", a message indicating that such wafer is "normal" (or no 

message at all) may be provided, for example, on the display 26 of the PMCU 24. When 
the wafer is categorized as "warning", a "warning" message may be provided, for 
example, on the display 26 of the PMCU 24. When the wafer is categorized as "failure", 
a "failure" message may be provided, for example, on the display 26 of the PMCU 24. 

25 Where the PMCU 24 is attached to a local-area-network (LAN), "warning" and "failure" 
messages may also be emailed to user-defined parties in order to alert such parties to the 
"warning" and "failure" conditions. A user that has received a "warning" or "failure" 
message might undertake efforts to check on the process being run or halt such process 
before additional wafers are processed resulting in abnormal wafers that may need to be 

30 scrapped. Although automatic control of the plasma process is not implemented in the 
present embodiment, in other embodiments, in the case of "warning" and "failure" 
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messages, the PMCU 24 may also be prompted to automatically send an instruction to the 
MCU 30 to automatically check/adjust process parameters in the case of a "warning" 
message or halt the process in the case of a "failure" message. After step 250, the plasma 
process fault detection module 100 proceeds to step 260. 

5 In step 260, the information generated by the plasma process fault detection 

module 100 for wafer i is output to a user. In this regard, the information may be output 
on, for example, the display 26 of the PMCU 24. Information that is output may include 
the normalized standard deviation value anormi for each model as well as the category 
message generated in step 250 based its normalized standard deviation value crnormj. In 

10 this regard, four normalized standard deviation values anornij, one for each model, are 
output, and two category messages, one for each of the dynamic models, are output. The 
normalized standard deviation value anormi for each model may be output both in the 
form of a number displayed on the screen as well as an ongoing plot that is updated each 
time a new normalized standard deviation value anormi is calculated. After step 260, the 

15 plasma process fault detection module 100 proceeds to step 270. 

In step 270, it is determined whether wafer i is normal. Wafer i is deemed to be 
normal if it is categorized as being within the "normal" category in step 240. If it is 
determined that wafer i is normal then the plasma process fault detection module 
proceeds to step 280. If it is determined that wafer i is not normal (e.g., it has been 

20 categorized as being within the "warning" or "failure" categories), then the plasma 
process fault detection module 100 proceeds to step 350. 

In step 280 it is determined whether sample matrix X m corresponds with one of 
the dynamic models. If it is determined in step 280 that sample matrix X m does not 
correspond with a dynamic model (i.e., it corresponds with one of the static models), then 

25 the plasma process fault detection module 100 proceeds to step 350. If it is determined in 
step 280 that sample matrix X m does correspond with one of the dynamic models, then 
the plasma process fault detection module 100 proceeds to step 290. 

In step 290 the earliest scan vector Xj (vector Xj having the lowest index i) 
included in sample matrix X m corresponding to each of the dynamic EP and nonEP 

30 models (i.e. matrices X3 and X4) is deleted from its respective dynamic model sample 

matrix X m . Thereafter, in step 300, the current EP and nonEP scan vectors \\ are added to 
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the ends of the sample matrix X m corresponding to each of the dynamic EP and nonEP 
models, respectively. As may be appreciated, updating of the dynamic models occurs in 
accordance with a first in first out (FIFO) scheme. After updating the dynamic models in 
steps 290 and 300, the plasma process fault detection module 100 proceeds on to step 
310. 

In step 310 it is determined whether conditions required in order to recalculate the 
U m matrix for each of the dynamic model matrices X 3 and X4 are met. In this regard, 
each time that a desired portion of the data in the sample matrices X 3 and X4 for the 
dynamic models has been updated in steps 290 and 300, recalculation of U m for the 
dynamic models may be undertaken. In the present embodiment, each time it is 
determined in step 310 that half of the data in the dynamic EP and nonEP model sample 
matrices X 3 and X4 has been updated, the conditions required for recalculating U m for 
each dynamic model matrix X 3 and X4 are met. By way of example, where the dynamic 
model size is 100, each time a total of 50 scans have been deleted and 50 new scans 
added (i.e. after scan 150 is added, after scan 200 is added, after scan 250 is added, etc.), 
the conditions required for recalculating U m are met. As may be appreciated, the 
conditions required in step 310 may differ (e.g., when a third or a quarter of the scans 
have been updated recalculation may be undertaken). If it is determined in step 310 that 
the conditions required to recalculate U m for each of the dynamic model matrices X 3 and 
X4 are not met, then the plasma process fault detection module 100 proceeds to step 340. 
If it is determined in step 310 that the conditions required to recalculate the principal 
components for each of the dynamic model matrices X 3 and X4 are met, then the plasma 
process fault detection module 100 proceeds to step 320. 

In step 320, once a sample matrix X m for a particular model is completed for the 
first time (i.e. when the plasma process fault detection module has proceeded from step 
180 to step 320 because it is determined in step 180 that wafer index i equals the model 
size) a singular value decomposition (SVD) is performed on X m for such model to obtain 
matrices U, S and V satisfying the relationship: 



Xm — Un 



(7) 
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where superscript T indicates matrix transpose, * indicates matrix multiplication, and m 
is the model number (e.g., m = 1, 2, 3, or 4 in the case of the present embodiment). In 
performing the singular value decomposition on X m , a thin SVD may be performed in 
order to reduce the amount of computations that are required. In this regard, a thin SVD 
5 is appropriate when the number of columns in X m (given by nw) is substantially less than 
the number of rows in X m (given by nl), as is the case for OES data. Since there are four 
sample matrices Xi, X 2 , X 3 , X4 in the present embodiment, four matrices Ui, U 2 , U 3 , U 4 
are obtained, as well as four matrices V u V 2 , V 3 , V 4 and four matrices S h S 2 , S 3 , S 4 . At 
this stage, each matrix U m is of dimension (nl, nl) and is a set of singular vectors for its 

10 corresponding sample matrix X m (essentially an orthogonal set of coordinates) ordered so 
that the first singular vectors, U m (l:nl, 1,2,3,...), lie in the "directions" (in the high 
dimensional wavelength space) of the largest variance in X m . In Step 330, the number of 
singular vectors kept in U m is reduced to a set of principal singular vectors representing 
the largest variance in X m , U m (l:nl, l:np), where np < nl. Various statistical criteria 

15 familiar to practitioners of principal components analysis may be used to decide on the 
size of the parameter np, the number of principal singular vectors to retain in the matrix 
U m . 

In steps 320 and 330, if the plasma process fault detection module 100 has 
proceeded from step 310 to step 320, then the SVD is only performed for the dynamic 

20 model matrices X 3 and X4 to obtain updated principal singular- vector matrices U 3 and U4 
for the dynamic models only. 

In either the initial case or the recalculation case, after the required principal 
singular-vector matrices U m have been determined in steps 320 and 330, the plasma 
process fault detection module 100 proceeds to step 340. 

25 In step 340, various data items (e.g., X m for each model, U m for each model, the 

normalized standard deviation values anormi for each model, the category assigned to 
each wafer for each of the dynamic models) are saved to a database file for subsequent 
analysis. The database file may, for example, be stored on the hard drive of the PMCU 
24. In the present embodiment, a user may specify whether the data is to be saved in a 

30 Microsoft SQL format or a Microsoft Access format. However, numerous other formats 
are possible as well. Once saved, an appropriate database program may access the 
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database file to perform functions such as querying, sorting and graphing the various data 
items and displaying wafer related information as well as the previously saved OES data 
obtained from a wafer. 

After various model data is saved in step 340 or if directed thereto from steps 270 
5 or 280, the plasma process fault detection module 100 undertakes step 350. In step 350 
the current wafer index i is increased (e.g., by adding 1 to the current wafer index i). 
Thereafter, the plasma process fault detection module 100 returns to step 120. 

Referring now to FIGS. 4A-B, there are shown plots of exemplary normalized 
standard deviations anornii for a dynamic model (FIG. 4A) and exemplary normalized 

10 standard deviations anormj for a static model (FIG. 4B) obtained by the plasma process 
fault detection module 100 for a number of wafers that have undergone the same plasma 
process within a plasma processing chamber 12. As can be seen by comparing the plots, 
anomalous wafers are more readily identified from the normalized standard deviations 
anormi from the dynamic model than from the static model. Further, the plot in FIG. 4B 

15 shows how over time as the processing chamber 12 becomes increasingly dirty, the static- 
model normalized standard deviations anormj may significantly increase. This 
information may be utilized to identify when it is time to clean the processing chamber 
12. 

While various embodiments of the present invention have been described in 
20 detail, further modifications and adaptations of the invention may occur to those skilled 
in the art. However, it is to be expressly understood that such modifications and 
adaptations are within the spirit and scope of the present invention. - 
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